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Abstract 



We introduce hierarchical neighbor graphs, a new architecture for connecting ad hoc wireless nodes dis- 
tributed in a plane. The structure has the flavor of hierarchical clustering and requires only local knowledge 
and minimal computation at each node to be formed and repaired. Hence, it is a suitable interconnection 
model for an ad hoc wireless sensor network. The structure is able to use energy efflciently by reorganizing 
[/J ' dynamically when the battery power of heavily utilized nodes degrades and is able to achieve throughput, 

O . energy efficiency and network lifetimes that compare favorably with the leading proposals for data collation 

in sensor networks such as LEACH (Heinzelman et. al., 2002). Additionally, hierarchical neighbor graphs 
\ have low power stretch i.e. the power required to connect nodes through the network is a small factor higher 

^ . than the power required to connect them directly. Our structure also compares favorably to mathematical 

' structures proposed for connecting points in a plane e.g. nearest-neighbor graphs (Ballister et. al., 2005), 9- 

, graphs (Ruppert and Seidel, 1991), in that it has expected constant degree and does not require any significant 

' computation or global information to be formed. 

2 ■ 1 Introduction 

. Topology control is a fundamental problem in the study of wireless ad hoc and sensor networks. A primary issue 
in this area is that of constructing a connected network between the nodes while keeping in mind the various 
• rH . constraints wireless devices operate under. Several architectures have been suggested to solve this problem, each 
p% ' striving to achieve multiple objectives vital to obtaining high throughput and low latency while expending as 
^ . little energy as possible. Bounded degree is one such objective, needed to reduce the overhead of channel state 
exchange between neighbors when nodes use MIMO antennas |28| . Another important criterion of a good solution 
is that the number of hops between nodes be small, required because multi-hop wireless networks are error-prone 
and the probability of packet loss increases with path length. For wireless nodes with sufficient battery power, a 
connected topology must also ensure that the power stretch-the ratio of power spent by two nodes in connecting 
through the network to power spent in communicating directly-must be kept low [T31 Chapter 9] . 

In this paper we propose a novel architecture for connectivity in wireless networks: Hierarchical neighbor 
graphs. Our structure is a randomized one that effectively combines ideas from skip list data structures [T71 [5] 
and nearest-neighbor graphs [25l [3] to give a hierarchical bounded degree structure for connecting points in a 
plane that has short paths between nodes both in terms of number of hops and distance. Hierarchical neighbor 
graphs can be built with local information and without any substantial computation. In order to find connections, 
nodes do not need to estimate any angles or distances, they only need to know the relative distance of their 
neighbors, which can be easily determined from signal strength. Unlike other nearest-neighbor flavored structures, 
hierarchical neighbor graphs are able to incorporate battery power as a parameter and ensure that nodes with low 
battery power are not expected to transmit to nodes that are located far away. This property, along with ease of 
deployment and reformation as battery power decreases, makes our structure a good candidate for collating data 
from a field of wireless sensor nodes. And, in fact, we demonstrate that our structure is more energy efficient and 
has better throughput than one of the leading proposals in this area, LEACH [H] , while having network lifetime 
comparable to it. 
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The key idea of our construction is that each node is assigned a level and chooses one neighbor from a level 
above it and its other neighbors from its own level. This assignment of levels is done through a random process 
which helps ensure that the degree of each node is bounded in expectation no matter what the positions of the 
points. For the special case where the locations of the points are generated by a Poisson point process we show 
that even when all nodes have equal battery power, the probability of the edges formed being long is very small. 
Encouraged by this we define a edge-length bounded version of our structure in which all edges above a certain 
length, determined by the battery power of the node, are deleted. We demonstrate that for a Poisson point 
process of sufficiently high density, this more practical structure is still connected. 

Paper organization and our contributions. In Section [2] we define hierarchical neighbor graphs. We also describe 
proactive and reactive routing protocols for this architecture and discuss how it can be easily adapted when 
nodes join or leave or when battery power changes. Additionally: 

• Wc show analytically that hierarchical neighbor graphs have bounded degree in expectation fSection l3.ip . 

• When the location of the points are determined by a Poisson point process, the probability of long-range 
connections being formed is very low. We also define a bounded connection length version of our structure 
and show through simulation that it is connected for Poisson point processes of sufficiently high density 
(Section 1321). 

• We show analytically that the number of hops between any two nodes in our structure is low (Section [231) • 

• We show through simulation that the distance stretch of our structure has an exponentially decaying 
distribution, decaying faster for pairs of points that are further apart. We show an initial theorem in this 
direction (Section (23). 

• We simulate the use of our architecture to collate data from a field of wireless sensors and show that it 
performs better than LEACH [S] in terms throughput and energy efficiency while matching it in terms of 
network lifetime (Section [4j). 

1.1 Related work 

The area of topology control for wireless ad hoc and sensor networks has seen a lot of activity over the years. 
Several surveys and books are available on this topic (see e.g. [TH [HJ [T3] ) . The proposed architectures range from 
the geometry-based Gabriel Graphs [5] and Relative Neighborhood Graphs [22] and their numerous variants, and 
direction based proposals like Yao Graphs [26j and their family. Li gives a fairly comprehensive survey of these 
kinds of architectures in Chapter 9 of his book [T3] . Of particular interest to us in this class of proposals are 
^-graphs [20] , in which the area around each node is partitioned into cones of angle 9 and the node is connected 
to the nearest neighbor in each cone. These graphs can be shown to be spanners i.e. the graph distance between 
any two points is a bounded factor larger than their Euclidean distance. Arya, Mount and Smid [T] refined this 
proposal in a flavor very similar to ours by combining skip lists with 0-graphs to propose skip list spanners, 
which have the spanning property of 0-graphs but have fewer hops between all pairs on nodes-0(logn) hops for 
a point set with n nodes-than 0-graphs have. While Arya et. al.'s proposal seems very similar to ours, they have 
one major disadvantage they share with all the other structures of the family of geometric graphs: constructing 
these kinds of graphs normally involves computation involving the relative position of a point and its neighbors 
and an area surrounding these points. We note that our hierarchical neighbor graphs avoid the computation 
overhead that these kinds of construction require. This also makes them superior to the class of topology control 
proposals that are based on computing dominating sets or spanning trees (see e.g. [23]), computations which 
are time consuming and construct structures which are difficult to adapt in dynamic situations. This class of 
proposals contains several variants, too numerous to recount here, so we refer the reader to an excellent survey 
by Santi [H]. 

Clustering based approaches to topology control have also been studied extensively in the literature. Some of 
the common assumptions and algorithmic features of these proposals are location unawareness, periodic selection 
of cluster heads, equal battery power but multi-power transmission ability. The systems are evaluated by 
evaluating throughput and network lifetime under the assumption that battery power is not replenished. Within 
these constraints Younis and Fahmy proposed a distributed clustering approach, HEED, for maximizing lifetime 
of ad hoc sensor networks . This approach was extended by Huang and Wu, with an additional assumption 



of uniform distribution of nodes [TU] ■ Basagni gave a method of choosing cluster heads based on a node-mobiUty 
parameter [4]. Heinzelman, Chandrakasan and Balakrishnan's architecture, LEACH, is an apphcation specific 
protocol for ad hoc wireless sensors [S]. Cluster heads receive data from sensor nodes and compress it before 
sending to the base station. Similar to our approach, LEACH incorporates randomized rotation of high energy 
cluster head among sensors, keeping the expected number of cluster heads in the network constant. By contrast, 
in our approach the rotation is implicit and does not require additional computation. Also, for LEACH, nodes 
must have an estimate of energy remaining in the network at the end of each round. Another assumption LEACH 
makes that we do not share is that number of nodes in the network and the number of cluster heads must be 
known to every node. In Section[3]we compare our structure to LEACH and find that under similar assumptions 
regarding transmission energy, hierarchical neighbor graphs provide better throughput and energy efficiency 
with matching network lifetime. We note that systems like LEACH attempt to save power by randomly rotating 
cluster head responsibilities. A more sophisticated approach involves choosing nodes with greater residual energy 
to perform cluster head roles (See e.g. [MIIIZDj and this is the paradigm in which hierarchical neighbor graphs 
fall. 

Another method related to our approach in the sense that clustering is implicit is CLUSTERPOW introduced 
by Kawadia and Kumar [TT]. No leader or gateway is explicitly selected rather the clustered structure of the 
network is manifested in the way routing is done. Each node can transmit at different finite number of power 
levels. A route is a non increasing sequence of power levels. 

On the mathematical front, nearest neighbor models have been studied by Haggstrom and Meester [7] who 
proved that there is a critical value fc^ dependent on the dimension of the space such that if each point is 
connected to its k nearest neighbors for any k > kc, an infinite component exists almost surely. Restricting this 
model to a square box of area n, Xue and Kumar [5S] showed that k must be at least 0.0741ogn for the graph to 
be connected. This lower bound was improved to 0.30431ogn by Ballister, BoUobas, Sarkar and Walters [3] who 
also showed that the threshold for connectivity is at most 0.5139 logn as well as corresponding results for the 
directed version of the problem. The advantage our model enjoys over this model is that by selecting neighbors 
carefully from the set of proximate nodes, hierarchical neighbor graphs achieve constant degree in expectation. 
Additionally our hierarchical structure also ensures paths with fewer hops. 

Finally, we end by mentioning that hierarchical neighbor graphs bear more than a passing resemblance to 
the skip list data structure [17] and its biased version [2j. 

2 Hierarchical neighbor graphs 

In this section we define hierarchical neighbor graphs and some special versions of them. We also describe routing 
protocols of these structures and methods for adapting them in dynamic settings. We will use the notation that 
for u,v € M^, d{u,v) is the Euclidean distance between the points. 

Consider a set of points V C M.^. We are given a function w : V ^ R+ such that each node u & V has a 
battery power c • w{u) associated with it, where c is a constant determined by the minimum battery power a 
node needs to operate i.e. c ensures that any functioning node has weight at least 1. Taking a parameter p such 
that < p < 1, we form the p-hierarchical neighbor graph on V with weight function w, denoted HN™ (T^) as 
follows: 

1. We create a sequence {Sn : n > 0} of subsets of V such that Sq = V. The point of a set Si-i are 
"promoted" to Si in two ways, one deterministic and one randomized. 

• Deterministically. all u G S'i_i with [logi w{u)\ > i arc put into Si. 

p 

• The remaining points of S'i-i are placed in Si with probability p independently of the choice of all 
other points. 

2. After obtaining the sequence of sets, we say that the level of u, levpiu) = i such that u G Si and u ^ Si+i. 

3. Each point u G ^ grows a circle around it which stops growing the first time a point v with lev{v) > i 
is encountered. This point is called parent{u). u makes connections to all nodes w with lev{w) = i that 
lie within this circle and to the node(s) of S'^+i that lie on the circumference of the circle. Note that this 
algorithm is fully distributed in nature. All connections are made through local interactions. 



Algorithm construct(HNp(F) = (y, ENp)) 

1. ENp = (f> 

2. At each node u do 

(a) Determine levp(u) — i such that u £ Si and u ^ S'i+i- 

(b) Find the nearest neighbor z G 'S'^ej;(„)_|.i of 
parent{u) ^ z; 

^^lev(u) ^ ^^lev(^^) ^ {(u,pareni(M))}; 

(c) For each v € '^lev{u) \ ^lev{u)+i ^^ch that d{u,v) < d{u, parent(u)) i.e w is closer to u than 
parent{u) 

ENp < ENp 

3. ENp ^ ENp U U^^o en;. 



Figure 1: Building subgraph HNp(F). 



The algorithm for forming HNp (y) is described in Figure [TJ 

For the special case where ■w{u) = 1 for all it G we denote the structure thus formed HNp(l/). 

2.0.1 Radius-bounded hierarchical neighbor graphs 

We define a variant of hierarchical neighbor graphs, called radius-bounded hierarchical neighbor graphs, which 
take another function r : V R+ as a parameter. We denote this graph built on a point set V with weight 
function w and parameter p by HNp' (V). The function r limits the transmission radius of the nodes i.e. node u 
is not allowed to connect to any node v such that d{u, v) > r{u). Hence HN^'^ {V) is a subgraph of HN™ (F) and 
may not be connected. This model is, however, a more practical one, since it does not assume that nodes can 
form connections with distant neighbors. As before, we denote by HNp(T/), the special case of HNp' {V) where 
all weights arc 1. 

2.0.2 Repairing hierarchical neighbor graphs 

Note that hierarchical neighbor graphs are adaptable in the sense that should a node u's power decreases to 1/p 
times its original amount its level can be readjusted by reducing the deterministic part of its promotion by 1, 
without touching the probabilistic promotions. This leads to an overall decrease of 1 in leVp{u). Similarly if the 
power available with a node increases to 1/p times the original amount, the deterministic part of the promotion 
of a node can be increased by 1, thereby increasing levp{u) by 1. Clearly when cither of these happens the 
structure requires some repair which we discuss next 

Node addition or battery power gain When a node u is added to V, it first determines levp{u). Once its 
membership in sets 5*0, ... , Si^^ is established, all points v with levp{v) < levpiu) whose radius of connection 
contains u truncate their radius of connection and make u their parent. Nodes v with levp{v) = levp{u) form 
connections to u if it lies within their radius of connection. All other nodes are unaffected. Similarly when a 
node u gains battery power, levpiu) increases from i to j and nodes of Si, Si+i, . . . , Sj may have to truncate 
their radius of connection if u lies within it. 

Node departure or battery power loss When a node u departs from V, all nodes that have u as their 
parent have to expand their radius of connection to find a new parent. Nodes in Sj^^y that were connected 
to u simply note its departure but don't have to change anything. Similarly when the battery power of a node 
u in HNp (y) decreases from i to j, nodes of Sk \ Sk+i that had u as their parent have to increase their radius 
and find a new parent, for j < k < i — 1. Nodes of Sj \ Sj+i simply note the departure of u from Sj. 



2.1 Routing 



We describe two approaches to routing on HNp(F), one proactive and one reactive. Before we do that, let us 
define some terms. The ancestor of u at level i, anci{u), is the node of level at least i reachable from u through 
a series of edges connecting nodes to their parents. The connected component of u at level i, for i < levp{u), 
denoted connpiu, i) is the connected component containing u of the subgraph of HNp(y) comprising only of edges 
between vertices v €V such that levp{v) > i i.e. it is the set of vertices whose are promoted at least up to level 
i and that are reachable from u without the path having to touch any node of highest level strictly less than i. 

2.1.1 Proactive routing 

In this method we extend the established ideas of distance vector based approaches. The algorithm proceeds 
in rounds, one for each level of the hierarchy, consolidating information from below and passing it up. The 

Algorithm hierarchicalDistanceVector(HNp(F) {V,ENp)) 

1. i^O 

2. For all li G y, < J < leVpiu): directoryp(u, j) ^ 0. 

3. While there is a node u with lev{u) > i which is not marked visited 

(a) Run a Distance Vector algorithm for connp{u) in which each node v exchanges 
directoryiv , i) with all other nodes. 

(b) For each v S connpiu) augment directory p{v,i) using the information gathered from the 
previous step and mark v visited. 

(c) Send directory p{v,i) to parent{v). 

(d) Recompute directory p[parent[v),i) by adding the information in directoryp{v,i). 

(e) increment i 

Figure 2: Constructing a routing table pro actively in HNp(V^). 

protocol is described in Figure [2l The idea is that at each level i each node v such that levp{v) > i computes 
a distance vector to all the nodes in it's connected component. But along with the distances to a node v in its 
component, each node u also receives a distance vector directory of all nodes of levels lower than i that can be 
reached through v without visiting any node with highest level i or above. The node u first uses this information 
to update its distance vector since it is possible that a node with a highest level lower than i is better reached 
through some other node at level i rather than through links going down into the hierarchy below u. It then 
consolidates its distance vector information and sends it to parent{u). The process then repeats at lev{u). 

When a packet needs to be sent from u to w, u discovers the best route by checking its directory at level 0. 
If it doesn't find v listed it queries parent{u) which checks the directory at level 1 and so on till we reach a level 
i where anci(u)'s directory contains an entry for v. The route discovered first goes from u to anciiu) and then 
proceeds to the node of Si listed as the next hop for v in anci{uys directory. 

2.1.2 Reactive routing 

The problem with the proactive routing scheme is that it requires a lot of storage at each level and computation 
in the set up phase. There is also the problem of keeping the routing tables consistent with the current state of 
the network, which might be expensive in a situation which evolves quickly. For situations where storage may 
not be abundant, e.g. ad hoc sensor networks, and situations where nodes come and go quickly, e.g. mobile 
networks, we now propose a reactive routing scheme which uses a hierarchical controlled flooding method for 
route discovery. 

The process of discovering a route from u to v proceeds in two kinds of phases, up the hierarchy and down the 
hierarchy. At first the source node u, floods connp{u, 0) to find if v is in it. If it doesn't find it, it sends a message 



to parent{u). This node now floods connp(parent{u) , 1) asking for a route to v. Each node v S connp(parent{u) , 1) 
sends messages down to aU w s.t. parent{w) = v asking them to flood their components at level in search of 
V. Subsequently, if v is not found in connp{parent{u) , 1), a message is sent to parent{parent{u)) and it launches 
a flood at level 2, and so on. We do not give a formal description of the algorithm here, pointing instead to 




Figure 3: If there is an i such that anci{u) G conn{anci{v) , i) then only the connected components lying at 
various levels beneath the vertices of conn{anci{v) , i) get flooded. 



Figure [3] to give the reader an idea of the region of HNp(X^) that gets flooded in the process of route discovery 
between u and v. Wc believe it is possible to argue that the mmrber of nodes flooded in this process is o{n) but 
we do not have a proof for this. 

3 Properties of hierarchical neighbor graphs 
3.1 Bounded degree 

We first show that for any point set V the expected degree of the subgraph HNp(V^) is Q( p(i''-p) )■ The proof 
of this can easily be adapted to show degree bounds for the case where weights are general, i.e. HN™(1/). We 
also show that the degree bound for HNp(y) can be improved to O(^) when V is generated by a Poisson point 
process. 

Theorem 3.1 The expected degree of any point v gV in HNp(V) constructed with parameter p on an arbitrary 
point set P is at most 

1 6 
P P(l-P)' 

Proof. Let us consider some point v £ P. For the purposes of analysis we think of the graph as directed and 
distinguish between outgoing edges i.e. the edges established by v and incoming edges i.e. the edges established 
to V by other points of P. We account for these two sets separately. 

Outgoing edges. Let us assume that v is promoted up to level i. Clearly v establishes no outgoing edges till this 
level. At this level it stops being promoted and searches for the point nearest to it that is promoted to level 
i + 1. Let us name the points of Si ordered by increasing distance from v as ui,U2, ■ ■ ■ ,1*15.1. Clearly for v to 
have k outgoing edges, the first k — 1 vertices in this sequence must not be promoted past level i and the fcth 
vertex must be promoted to level i + I. Hence 

P(t; has k outgoing edges) = (1 — p)'' - p. 

Therefore 

\s.\ ^ ^ 

E(No. of outgoing edges from v) < \^ fc • (f — p)^ ^ ■ P < —■ 

k=l ^ 

Incoming edges. Now let us consider a level j < i i.e. a level at which v exists and may be connected to by other 
vertices of P. We denote by Xj the number of edges established into v by points that belong to Sj \ Sj+i i.e. by 



points that are promoted up to Sj but not beyond. Note that at level j the candidates to establish edges to v 
are looking for a vertex of level j + 1 to connect to. These vertices establish an edge with v either because none 
of the vertices nearer to them than v have been promoted to level j + 1. 

In order to analyze this situation, let us consider the sequence a{v,j) — ui,U2, • ■ • j"|s | of vertices of Sj 
ordered by their increasing distance from v. We partition the space around v into six cones sub tending angles of 
^ at V and index these six partitions with the numbers 1 to 6. Further we partition the sequence a{v,j) into six 
subsequences a{v, = u^^, u|, . . . , 1 < i < 6 where the points of a{v, j, i) lie entirely within cone i. As is clear 




Figure 4: Note that cannot have an edge to u if either or u\ have a higher level. 

visually from Figure 31 using elementary geometry we can claim that the distance between two points within a 
cone is less than the distance between the further of these two points and v i.e. 

Claim 3.2 Given two points u\ and u\ such that k < £ 

d{uk,ui>) < d{v,ui). 

Note that for v to have at least m incoming edges, at least ^ of these edges must come from points of one of the 
six subsequences, say a{v, j, i*). In view of Claim this means that the first fc — 1 points of a{v, j, i*). Hence 
we can say that for v to have at least m incoming edges, the first \^~\ points of a{v,j, i*) must not be promoted 
to level j + 1 i.e. 

P(u has at least m incoming edges) < {I — p)~^ ■ (1) 

Therefore E(No. of incoming edges to v at level j) < X]m=i(l ~ p)^^^ < 6X]t=i (1 ~ pY ^ p- 

What we have shown above is that E{Xj) < ^, j > 0. Also, wc know that for Xj to be non-zero, v must be 
promoted at least till level j, which happens with probability p^ . With this in view we calculate E{J2j>o -^i^ 
i.e. the expectation of the total number of edges incoming to v. So, we have 

\j>0 J ]>0 

6 

p(l -p)' 

□ 

The proof described above can easily be adapted to show the following theorem for HN^ (y) for a general 
weight function: 

Theorem 3.3 The expected degree of any point v €V in \-\Np(V) constructed with parameter p on an arbitrary 
point set V CM.'^ is at most 

16/ , , 1 

- + - • logj. w[v) + 

p p \ p t — p 

We omit the proof here because it essentially follows the proof of Theorem 13.11 pointing out to the reader that 
inequality ([T]) holds as a pessimistic estimation of the probability in the weighted case, since the deterministic 
promotion of weighted nodes would make them higher in general than nodes of weight 1. 



< 



For point sets generated by a Poisson point process we show a better bound of 9{-). We note that it is quite 
expected that the degree bound does not contain the intensity of the point process A in it, since HN™ (V) is 
essentiahy a nearest neighbor model. 

Theorem 3.4 The expectation of the degree of HNp[V) constructed with parameter p on a Poisson point process 
V with density A is at most ^. 

Proof. As before we account separately for outgoing edges and incoming edges. We inherit the bound for 
outgoing edges from the proof of Theorem 13.11 where the setting is more general. Wc focus here on improving 
the bound for incoming edges. 

Consider a vertex v that has been promoted up to level i. Consider a level j < i. We denote by Xj the 
number of edges established into v by points that belong to Sj \ S'j+i i.e. by points that are promoted up to Sj 
but not beyond. 

We partition the space around v into six cones sub tending angles ^ at w and number them from 1 to 6. 
Consider a point u at a distance r from v in cone fc; 1 < fc < 6. Let Xj denotes the edges from vertices in cone 
k. By the symmetry of the cones and linearity of expectation we can say that 



So, wc focus on one of these cones and compute an upper bound on the expected number of edges incoming to 
V from this cone. In view of Claim [3T2l for a point u € Sj \ S'j+i to connect to v all the points of Sj in cone 1 
which are nearer to v than u must not be promoted to level j + 1 i.e. they must all belong to Sj \ Sj-^-i. Note 
that this is an upper bound since wc disregard the points outside the cone which might be closer to u than v 
which might be promoted to S'j+i and also the points within the cone but further from v than u which might 
be closer of u than v is and which might be promoted to Sj+i. Both these kinds of points would prevent u from 
sending an edge into v but we disregard them since we are only looking for an upper bound. 

In order to compute this upper bound on Fi{Xj) we consider a segment of the cone at a distance r from v 
which infinitesimal width dr. Since the area of this strip is infinitesimal, the probability that there are 2 or more 
points in this strip is o{dr) and hence can be neglected. Hence the expected number of edges from this strip 
into V can be upper bounded by the probability that there is an edge from within this strip going into v. This is 
computed by computing the intersection of the events "the i + 1st nearest neighbor of v in cone 1 is in the strip 
of width dr at distance r" and "none of the i points of Sj in the sector of radius r in cone 1 arc promoted to 
level j + 1". Since these two events happen on disjoint areas they are independent. To compute the expectation 
of Xj we simply integrate over all values of r from to oo for each value of i and sum over all values of i from 



E{X) = 6-E{Xi). 



to oo i.e. 




■Xp-' —rdr 




1-p 



P 



So we get that 




1 



1-p 



< 6- 



l-p 



P 



6 



P 



□ 



As before, there is a theorem analogous to Theorem 13.41 for the case where weights are general. We state 
that theorem here, noting that we do not use the terminology of a Point process in since that would have, 
in general, an infinite number of points and if the weight of each of these points were considered to be at least 
some e > 0, that would amount to an assumption of infinite energy. 

Theorem 3.5 The expected degree of any point v Cz V in HNp{V) constructed with parameter p on a finite set 
of points V distributed uniformly at random in a bounded area A C is at most 



The proof of this theorem follows the proof of Theorem 13.31 and is omitted here. 

Discussion The result of Theorem 13.41 assumes significance when seen in light of Ballister et. al.'s result on 
connectivity of fc- nearest neighbor graphs [Sj. They considered fc-nearest neighbor graphs on point sets in i.e. 
each node establishes edges with its k nearest neighbors. Improving on a result of Xue and Kumar [5^1, Ballister 
et. al. showed that for a Poisson point process with A = 1 on a square of area n, the probability of the fc-nearest 
neighbor graph being connected tends to as n — > oo for fc < [0.3043 log nj . In contrast, our p-hicrarchical 
neighbor graphs achieve connectivity, even within a square of area n with an expected degree not depending on 
n. 

Also, we note that nodes in structures like 0-graphs [HI HO] and Yao [22] graphs have constant outgoing 
degree but may have arbitrarily high incoming degree. In fact several papers have been devoted to constructing 
constant degree versions of these structures at some computational expense (see [131 Chapter 9] for a thorough 
treatment). Theorem 13.11 does not preclude the possibility of a node having high degree in hierarchical neighbor 
graphs, the bound on degree is only in expectation, but it has the advantage that the kind of positioning of 
points that prove pathological for 0-graph-like structures, still retain some probability of having bounded node 
degree, at no extra computation cost. 

3.2 Expected edge length in HNp{V) 

Since nodes in wireless networks are driven by (limited) power batteries, their transmission ranges are finite. A 
network architecture ignoring this limitation is impractical. In order to show that hierarchical neighbor graphs 
are sensitive to this constraint we will show that the probability of long connections being formed in HNp(F) is 
very low when V is generated by a Poisson point process of density A. As a consequence of this we expect that 
for every value of r > 0, the radius-bounded hierarchical neighbor graph HNp ' {V) is connected as long as A is 
sufliciently high. We demonstrate by simulation that this is indeed the case. In fact, for every r > there is a 
value Amin('') such that HNp'^^Ct^) is connected for all A > Ai„in(''), for the special weight function / that assigns 
weight 1 to all nodes. 

We begin by bounding the probability that 2 nodes which are distance I apart are connected directly. 

Proposition 3.6 Consider HNp(y) constructed on a point set V generated by a Poisson point process of density 
A. Given any two nodes u^v £ V , with d{u, v) — I, the probability that u forms an edge with v or vice versa in 
HNp(l/), denoted g^(l), is upper bounded as 



Proof. We use g^{l) to represent this probability. Consider 2 nodes u and v s.t. leVp{u) — i, levp{v) — j and 
d{u,v) = I. Note that if i < j, u connects to v iff there is no node w s.t. levp{w) > i and d{u,w) < d{u,v). If 





i = j, u and v have an edge iff there is no node w s.t. levp{w) > i and d(u, w) < d{u^ v) or d{w^ v) < d{u^ v). 

oo 

OO 

oo 

Now we upper bound the summation by integrating over the function p^Xg-Ax; p + ^ giving us the resuh. □ 

In the radius-bounded hierarchical neighbor graph, HN^ ' (V) we go a step further towards incorporating real- 
Hfe constraints and remove all connections between nodes more than a certain distance apart. Our simulations 
showed that for HNp(y) built on a uniformly and randomly distributed points is still connected for all r > 0, 
provided the density of these points exceeds a minimum value that depends on r. To investigate the relationship 
between this minimum density (Amin) and transmission radius (r), we simulated HNp(V^) increasing the density 
of the points till we achieved a connected network. In order to verify that the value we determined as Amin was 
not an anomaly, we ran the simulation for 10 increments of A past the first value where a connected network was 
achieved, and only fixed Amin when the network was found to be connected for all these 10 increments. Figure [5] 
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Figure 5: For HNg 5(1^) the curve is Amin?-^ = 3.62. (Best fit) 



shows the dependency of Amin on r. We fixed p = 0.5. The point set was scattered randomly within a 10 x 10 
square. On the Y-axis is Amin (above which network is connected) plotted against r, the max transmission radius 
allowed. We found the relation to be of the form Amin-?"^ = c, c being a constant. This is reminiscent of the 
critical phenomena of unit disk graphs built on Poisson point processes. If the radius of each disk is r, there is 
a critical density Ac(r) above which the unit disk graph has an infinite component. And it can be shown that 
Xc[r).r'^ is a constant for r > (See [15] for details). We feel such a theorem may exist for HNp(F). 

3.3 Number of hops 

To bound communication delay in networks, one must construct a topology with a small hop-stretch factor. Hop 
spanners were introduced by Peleg and UUman [16j and were used as network synchronizers. In this section we 
analyze the hop stretch in hierarchical neighbor graphs. 

For finite point sets and Poisson point processes limited to finite regions it is easy to show exponential decay 
of the number of hops between points. We introduce the notation ht(y\H^ {V)) = maxuGV levp(u). For finite 
point sets, we claim the following theorem, that follows easily from standard skip list analysis. 



Theorem 3.7 For \-\Np(V) constructed on a finite set V, define W{V) = X^^ey w{u). Fork > max„gy logj. w{u), 

P{ht{HN'^{V) >k)<W-p''. 

Proof. For a given node u gV, the probability that levAu) > fc, for any k > maxygy \}ogi w{u) \ is p ? 
And since the height of HN^ (1/) is at least k if there is at least one node with height at least fc, applying the 

A:— log 1 wiu) , 

union bound on probabihties we get that P(/if(HNp (F) > k) < J2uevP ^ =W - p . □ 

The nodes at level ht{\-\Np (V)) are fully connected to each other (since they are not able to find a node at a 
higher level). This means that in HNp(T/) on a finite set V, for any two nodes to connect to each other the path 
has to travel at most 0{ht{HNp{V)) levels up and down the hierarchy. Hence Theorem 13.71 imphes that the 
number of hops between any two nodes has an exponentially decaying distribution. 

For point processes limited to finite regions, the following theorem gives us the result that the number of 
hops between points decays exponentially: 

Theorem 3.8 Given a finite region A C M^, with area i{A), and a set of points V in this region generated by a 
Poisson point process of density X, then for any k > 0, 

P{ht{HNp{V) > fc) < A • £{A) ■ /. 

The proof follows easily by conditioning on the number of points in A and using the argument from the proof 
of Theorem 13.71 to bound the conditional probability of the height being greater than k. We omit the details. 
Theorem l3.8l can be extended to general weights as well, but we would have to carefully define how the weights are 
distributed by the Poisson point process. We omit that case here since it is basically a mathematical digression. 

For Poisson point processes in we found through simulation that the number of hops varies logarithmically 
with the distance between the pair of points being connected. We do not have an anlytical proof for this fact as 
yet. 



3.4 Bounding the stretch 

A major concern of topology control mechanisms is that the graph be a spanner i.e. given a point set V and a 
interconnection structure G, if we denote the shortest distance between points u,v ^ V along the edges of G by 
da{u, v), the ratio 

be = max — -, 

u,v£V d{U,V) 

known as the distance stretch of G should be low. For example 0-graphs have distance stretch x-sin(6i/2) EO]' 
The power stretch of G is defined as the ratio of the power expended by communicating through the links of G 
to the power expended in communicating directly. The power stretch of G is known to be upper bounded by 
Sq |14| . where /3 is a constant between 2 and 5 that depends on the medium, hence we only consider distance 
stretch here. 

In order to study the spanner properties of hierarchical neighbor graphs, we performed a simulation and 
computed the distance stretch of HNp(T^) constructed on a set of points randomly distributed on a torus of unit 
area. In Figure [5] we plot the graph distance of pairs of points of V that have a Euclidean distance of 0.1. We 
found that the log of the number of pairs of vertices stretched to extent s decreases linearly with s which leads us 
to believe that the probability of a pair of vertices being stretched decays exponentially with the distance stretch 
value. We also found that the number of pairs with lower stretch values dominates the number of pairs with 
higher stretch values as the distance between the pairs increases. These observations lead us to the conjecture 
that the stretch for pairs of points in HNp(y) follows a probability distribution that decays exponentially in the 
stretch value, and also decreases as the distance between the pairs increases, which effectively means that distant 
points undergo very little distortion in their distance when connected through HNp(y), although nearby points 
may occasionally have to connect to each other through a long route. 

Analytically, we were able to prove an initial theorem that confirms our simulation results, but gives a weaker 
result. 
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Figure 6: Stretch graphs for pairs with d{u^ v) = 0.1. A = 500. 



Theorem 3.9 Given parameters p and X s.t. < p < 1 and A > 0, the graph HNp{V) built on a set of points 
V generated by a Poisson point process with intensity A has the property that for any two points u,v G V such 
that d{u, v) = I, there are positive constants Ci and C2 < 1 depending only on p and A such that for < 9 < ^ , 

Before proceeding with the proof of Theorem 13.91 we need some prehminaries. Given a point set V, our 
algorithm for constructing HNj,(X^) defines a probability space {il,T,P). A realization of this process i.e. an 
element of ft, is of the form u; = {V = Sq, Si, . . .). Let us denote the ith set in this tuple as uj{i). Wc define a 
partial order on the set of all realizations as follows, w ^ w' if C Lo'{i) for all i > 0. An random variable 
N defined on this probability space is called an increasing random variable if N{uj) < N{uj') whenever to ^ lu' . 
Similarly a random variable N is called decreasing if — iV is increasing. An event is called increasing if its 
indicator is an increasing random variable. For our setting this definition allows us to state a specific version of 
the more general FKG inequality [5]. 

Lemma 3.10 For any two events A and B that are either both increasing or both decreasing, 

P{AC\B) > P{A) ■ P{B), 
where P{-) is the probability measure defined on fl. 

Now we turn to the main proof. 
Proof. In order to prove this theorem we draw on the paper on 0-graphs by Ruppert and Seidel [20| in which 
they extended earlier work by Keil [12] and showed that paths that stayed within a cone of angle centered 
around the line joining a point to its destination have stretch at most 1/(1 — 2sin(0/2)). The proof proceeds by 
calculating a lower bound on the probability that a there is a path between any two points that stays within a 
cone of angular width 6. 

Given two points u and v, we define the 9-cone of u w.r.t v as a sector of a circle of radius d{u, v) centered at 
u with an angle 9 which is bisected by the line segment joining u and v. We denote this cone V^(m). We define 
a 6-good path between vertices u and t; as a path u = uq ^ ui . . . ^ Un = v such that Ui+i is in V^(wi). 

Wc also define events that will be relevant to us 

• G^„: There is a S-good path between vertices u and v. 

• G^y{i,j): There is a ^^-good path between vertices u and v conditioned on the fact that lev{u) = i and 
lev{v) = j. 



• Gu^v'- The event that there is a direct edge between vertices u and v. 

• Gu^vih j)'- The event that there is a direct edge between vertices u and v conditioned on the fact that 
lev{u) — i and lev{v) ~ j. 

Our proof proceeds by computing the probabihty that the two points are either directly connected or u is 
connected to a point w in V,y(w) which then has a (?-good path joining it to v. Note that the second case is an 
intersection event. Hence in order to proceed we wih need the fohowing lemma which shows that the two events 
being intersected are positively correlated. 

Lemma 3.11 Given a set of points X generated by a Poisson point process in 'E? , consider three points u,v,w Q 
X such that w £ V^(m). Then 

(oc \ oo 

U{G„^^(z,fc) nGl(fc,j)} > ■P{GLik,j)) . 

k=0 I k=0 

Proof. We proceed by showing that for a given realization of the Poisson point process the events we are 
considering arc both decreasing. Hence we can apply the FKG inequality to these events conditioned on a 
particular realization, then dccondition to get the result. Let us formalize this. 

Following [15j we view the Poisson point process C in the following way: Consider the family of boxes 



K{n,z = (zi,Z2)) 



Zl Zi + I 

2" ' 2" 



Z2_ Z2 + 1 

2" ' 2" 



for all n G N, z G 



For a given realization of X, each point x € X is contained in a unique box K{n, z{n, x)) for each n G N. With 
probability 1 there is a unique smallest number n[x) such that K {n{x) , z{n{x) , x)) contains no other points of 
X. Hence we can view the realizations of X in terms of sets of boxes taken from 



U U ^('^'^)' 



such that each box contains exactly one point i.e. we have a measurable mapping from the probability space of 
the Poisson point process to a space Vli C JC{J?) and an outcome w G fii is a set of boxes containing exactly 
one point each. 

In order to describe the probability space on which HNp(X) is defined, we observe that each realization 
of HNp(X) can be completely described by the levels of the points of a particular realization of X. Defining 
fJ2 = N U {0} we get that HNp(X) is defined on a product space VL = Vli x^2- 

Now, let us consider a given a; G f2i and two points u^v £ X for this outcome. Let us denote by G{ijo)u^v (hi) 
the event that for the realizations mapped to u and v are connected by a directed edge, conditioned on the 
fact that lev{u) = i and lev{v) — j. We claim that this is a decreasing event. To see this note that the event 
G{uj)u*-*v(i, j) depends only on the levels of all the points of to apart from u and v. Consider two outcomes a 
and a' contained in the subspace of Q defined by wi and {lev{u) = i, lev{v) = j}. Assume that a ^ a' as defined 
at the beginning of this section. 

Let us assume that i < j. Clearly if G{uj)u^v{i, j) does not occur in a, there is some point w G cji such that 
lev[w){a) i.e. the value of lev{w) in outcome a, is strictly greater than i and d{u,w) < d{u,v). Since a ^ a', 
this point w exists in a' and lev{'w){a') > lev{w){a) > i and hence there cannot be a direct edge from u to w in 
a' either. If i = j, we can find two such points that prevent a direct edge being formed between u and v and 
exactly the same argument holds. We state this as a proposition. 

Proposition 3.12 Given an lu E Qi and two points u,v of the Poisson point process, G{u))u^v{iT j) is a de- 
creasing event for all integers i,j > 0. 

Again starting with an G ili and a sequence of points a; = xi, . . . , a;„, n > 2 of a realization of the Poisson 
point process mapped to we define the event 



and given another sequence i = «i, . . . , z„, n > 2 of non-negative integers, we define 

A{uj)x{i) = {A{oj)x I lev{xj) ij, 1 < j < n}. 

In other words, the event A{LLi)x{'i) is the event that the points of x form a path in HNp(X) conditioned on their 
levels being fixed. We claim that events of the form A{w)x{i) are decreasing events. 

Proposition 3.13 Given an uj E Qi and a sequence of points x = xi,...,a;„,n > 2 of the point process, 
A{io)x{i) is a decreasing event for every sequence i = ii, . . . , i„, 7i > 2 of non-negative integers. 

This is not difficult to see since the event A(w)xii) is a finite intersection of the kind of events we argued were 
decreasing in Proposition 13.121 

Before proceeding we observe that a bounded area around V^(u) is all that we are considering since points 
outside a certain region do not affect the events we are discussing. In the following when we talk about an 
outcome uj E Qi, we will in fact only be referring to that outcome limited within this bounded region. 

Now we turn to the left hand side of the inequality in the statement of the lemma and note that it can be 
rewritten as 

^ p( [J{GH„^.(*,fc)nGHl(fc,j)}) -PH, (2) 

where P(w) is the probability of a particular outcome. Note that since we have restricted our outcomes to a 
finite region containing V^(u), the probability P{uj) is non-zero. 

We observe that the event G{uj)^,^{k,j) can be written as a union of events of the form A{ijj)x{i) where 
Xi = w and x„ = w as long as all the points of x have the property of 0-good paths. To write this formally 
we define a"{uj)(w,v) to be all the sequences x of points that have these properties i.e. xi = w,Xn = v,Xi £ 
V^(.T,;_i), 2 < i < n. Also we use the notation Nq = nj=i ^ {0}- Hence we get 

oc 

GMl(fc,j)=U U U A{u^)x{i). 

Putting this into ^ we get that 



oo oo 



LHS = ^p MJIJ U y {G{co)u^^i^,k)nA{uJ)x{i)}] -n^) 



oo oo 



= EEE E E P({GM„^.(z,fc)nAMa;(i)}) 

LoeUi k=0 n=2 j|-pjn-2 xe(y"iuj){w,v) 

■P{uj) ■ P{lev{w) = k) ■ P{lev{xj) ^ij,2 < j <n- 1). 
From Propositions 13.121 and 13.131 and the FKG inequality (Lemma I3.10p we get that 



oo oo 



LHS > EEE E E P{G{co)u^Uhk))-P{A{u^)x{i)}) 

■P{uj) ■ P{lev{w) = k) ■ P{lev{xj) ^ij,2 <j <n~ 1) 



> 



^ ^ P (G(w)„«^(i, k)) ■ P{lev{w) = k) • P{co) 

oo 

•EE E E P{A{u:)x{i)})-P{io)-P{lev{x,) = i„2<j<n-l) 

io£Qi n=2 i^fH^-^ Xecr"(Lij){w,v) 

RHS 



□ 



Since the underlying point set V is produced by a stationary process, the probabihty of having a particular 
path between a pair of points a certain distance apart is the same as the probability of having the same kind of 
path for another pair which is the same distance apart. Hence we define the following functions which we will 
need 

. g{l) = PiGiJd{u,v) = l). 

• gihjJ) = P{Gl^ \,d{u,v) = l),levp{u) = i,levp{v) = j). 

• =P(G„«„ I d{u,v)^l). 

• g^{ij,l) = P{Gu*-*v I levpiu) = ijevp{v) = j,d{u,v) = 0- 

We start by looking at g(i,j,l) for some i,j > 0. We divide this probability into two cases. The first being 
that there is a direct edge between u and v, and the second that there is a direct edge between u and a point 
w of level k which further has a 0-good path to v. We sum this probability over all possible values of k. Note 
that when i < k, u connects to w when looking for a point of level i + 1, and when i> k, w connects to u when 
looking for a point of level fc + 1. In case w gives an incoming edge to m, we sum over the probability that the 
closest neighbor of u belonging to level k has a ^-good path to v. We get that 

P(GL(*,j)) = P(G„^.(z,j)) + (l-P(G„-.(*,j)))-P I U G„„^nG» 

\ioevJJ(M) 

= P(G„^,(i,j)) + (l-P(G„^.))-P ( U {jGun,{i,k)r^Gl 

\we\/l{u) k>0 

= P(G„„,) + (1-P(G„„„)) 

rd.{u,v) j 

■ P U [J{Guw{i,k)nGi,\x <d{u,w) <x + dx} 

• P{3w e V^(u), a: < d{u, w) <x + dx). 

Now since we are considering infinitesimal widths, the probability that there is more than one point at a distance 
between x and x + dx from u in V^(u) is o{dx) so we can neglect it. Hence we get 

fd{u,v) I \ 

P(GL(i,j)) = / P\\\{GuAhk)C^Gi,\x<d{u,w)<x + dx}\ -XOxdx. 

•'0 \k>0 J 

Now, using the result of Lemma 13.111 we get 

oo i'd{u,v} 



P(GL(*,j)) > ^ / P(G„«^(z,fc) 



I X < d{u, w) < X + dx) 



■ P {GIv ik,j)\x < d{u, w) <x + dx) ■ XOxdx. 

For k < i we have 

pa 

p(gL(*,j))>5] / 



k=0 

and for k > i 



P(Gl(z,j))>^/ e-^-^'p" -e-^^-'p'^^exdx. 
k=o-^° 



Let us assume that d{u, v) = I In the previous equation if d{'w, v) = 5 we get 



For n > m 

Putting these values in we get 



fc=i+l 



{/I * / 1 - p-2TTl^p^ \ 
i^^'E ( 1 •/(l-p)-5(^,J,^-'5) 



^ p'^(l-p)-g(fc,j,/-<5) 



fc=i+l 



47r \ 

We use the simple fact that for fc < i 
to get 



[ ^ k=Q ^ 



e / 1 - e~2A7rZ-p 



47r I p'+i 



^ p'=(l-p).g(fc,j,Z-<5) . 

In order to simplify this wc prove a small lemma: 

Lemma 3.14 Given a parameter p s.t. < p < 1 and two sequences, {a„ = ^ : n > 0} and {6„ : n > 0} which 
takes non-negative values: 

p'{l-p) 



E '^'^ • ^ 1 _ „,+i • E • E 

ri=0 ^ \ri=0 / \m=0 

Proof. We start from the right hand side of the inequality: 

i i i i i 

E ' E 



n=0 n=0 



E^» 


■ bn - 


i-E^» 


■ E 


E^» 

n=0 


■ K ~ 


i 

^E^» 


i 

■ ^ am 

m— 1 


E^n 

ri=0 


■ K - 


^E^» 


■E — 

m— 1 


i 

E«» 

ri=0 


■ K - 


i 

^E^» 

ri=0 


1 -p' 


(1 -p)p'- 



Now, since {a„} is an increasing sequence whose minimum value is 1, achieved at n = we can say that 



1 — p- 

On ^ > a„ ■ o„ + - 
^ — ' r. 

n=0 n=0 

+p'{l-p) 



< 



Using the incquaUty of Lemma 13.141 we get 



\ fc — fij — 

oo ^ 

J2 p\l~p)-g{k,j,l-5) \-{\-g^{i,j,l)) 



2^ _ g-2A7r;2p'+i \ °o 



47r \ 



k=i+l 



+ 1 1^ • (1 - e---^^-^^ ) . ■±p\l-p)-g{k,j,l^5) 



pHl~P)-9{k.J,l~5) \-{l~g^{i,j,l)) 

k=i+l ) 



47r \ / 

We now get a form which is easier to handle i.e. 

ft °° 
J, > g^{t,j, I) + •(! - J, /))— (l - e-2^-'V+^ j . ^/(i _ ; _ s). 

k=0 

Observing that g{l) ^ J2'iLoT.T=oP'^^ -~P)p'{^ -p)9{hj,l), and noting that for i < j, 
the inequality of ^ gives us 

oo oo 
i=0 j=0 



2=0 i=o fc=o 



2=0 j=i k=0 



Observe that 

oc oo 

g^il)=J2Y.P'i^-p)P^i^-p)9^ihjJ)- 

1=0 j=0 

With this and the simple fact that for j < i 



we get 

oo „ 
1=0 

We begin by lower bounding g^{l). 

oo oo 

oo oo oo 



^ \ -L f } 

i— j— i i— 

1=0 



(l-p2).(^p2,:g-A.;V+i) 



Now we lower bound the summation by integrating over the decreasing part of the function p^^e ^ ^ , after 
which we get 

We now turn to the coefficient of g{l — 5) and observe that, 





/•oo 


) >- 









ln(l/p) \ 2XnPp X-Kpp SXnPp 

Note that 

3A7r?2p - 2XTrPp 

which gives us the recursion 



' ~ [XttPpY Att ln(l/p) \ AttZ^p / ^ 



(5) 



Consider the function 



(l-p) f 1 - e-^'^'^P 



ln(l/p) \ XttI^p J 

For a{p) to be greater than some constant r/ > we require 



(6) 



Under the condition that 

we get that ([6]) is satisfied whenever 

A./^,(l-,.i^)>l. 



We denote the least value of I for which this inequality is true by 7(77) i.e. 



Now, in order to lower bound the probability of u and v having a 0-good path, we choose an 77 > satisfying ([7]) 
and unfold the recursion obtained in ([5]) till the path either connects directly to v or reaches a point w such that 

d(w,v) < 7(7/). Observing that ^lyrpp)^ ^ ^ ^ ^^{XTr'ii-sy'py^ ^ ' ^^'^ that the number of edges we may have in 
the path is finite but unbounded we get a geometric series sum when we open the recursion, giving us 

where I* < j{ri). Let us assume that the point we have reached is w with I* ~ d{u,w) < ^{rf}. From this point 
we only consider the case that w connects directly to v. Using the lower bound of ^ we get that 



(l-3e-2)(l-p2) (l-3e-2)(l_p: 



2^ 



(A7r?*2p)2 - (A7r7(7?)2p)2 

Substituting this in ([9]) we get 

(l-3e-2)(i_p2) (i_3e-2)(i_p2) ^ 



5(0 > 



(A7r7(77)2p)2 (A7r/2p)2 1 _ |£. 



Substituting the value of 7(77) from ^ we get 



, ln(l/p) V (l-3e-2)2(l^p2)2 1 

'^'^-''""■T^j T^' 



□ 



4 Hierarchical neighbor graphs as a clustering mechanism for WSNs 

In this section wc simulate the situation where a hierarchical neighbor graph architecture is used as a clustering 
mechanism to collect data from a field of ad hoc wireless sensor devices. We compare the performance of our 
architecture to that of LEACH [S] under assumptions similar to theirs and find that hierarchical neighbor graphs 
compare favorably. 

The quality of a clustering architecture that collects data from wireless sensor networks is measured by 
determining: (a) Data throughput. In certain sensor networks applications, the amount of raw data that is com- 
municated may not be as important as the information content, or the effective data, since the data from certain 
nodes may be redundant. Hence we consider effective data throughput as measure of network performance, (b) 
Energy efficiency i.e. the energy expended in transmitting a certain amount of data to the base station, (c) 
Network Lifetime. Often it is not possible to recharge node batteries. We therefore study the network lifetime , 
which we define here as the time it takes for all nodes in the network to die out. 

We consider a typical field of wireless sensors with a set of nodes V in which a large number of sensor 
nodes continuously sense data, process it and communicate the information to an external sink or a base-station 
(BS). The sensors coordinate among themselves to form a hierarchical communication network, its architecture 
determined by HN^ (y) where the weight function w depends on the battery power available at a node at any 
point of time. Nodes at higher levels of the hierarchy get depleted of their energy quicker, and so there is a 
heterogeneity in the power profile of the network. We periodically reform the network to distribute the energy 
load according to the residual energy at each node. Once the network topology has been formed, each node 
sensor constantly monitors its environment and periodically sends the data up the hierarchy to its parent. The 
nodes at the top-most level of HN™ (F) communicate directly with the BS. 



Sensor data is often highy correlated locally so we aggregate data, reducing the number of messages trans- 
mitted to the BS, hence improving the energy efficiency of the network. We consider two different applications 
of WSNs and the corresponding data aggregation models used, (a) Limited Aggregation: Only data signals from 
nodes located close to each other are highly correlated and can be aggregated into a single signal. We believe 
that in HN^ (F), nodes which share a common parent are located close to each other and hence the data signals 
are correlated. In this case, a bounded number of data signals are fused into a single data signal, (b) Unlimited 
Aggregation: All data signals, irrespective of location can be fused to get a single signal. This model is valid for 
applications in which we are interested in quantities like the average, min or max of a set of values e.g. radiation 
level monitoring in a nuclear plant where the most useful information for the safety of plant is the maximum 
value [TH]. In this case, all data signals at a relaying node are fused into a single signal. 

Network setup and operation. We assume that each node has the computational ability to support MAC protocols 
and perform the signal processing functions required. To begin with, nodes V organize themselves into HN™ (F). 
The weight function w is given by battu/battjft,, where batt„ is the residual battery of node u and haXtth is 
the threshold energy below which a node is declared dead. All nodes {u s.t. lev{u) = 0}, periodically transmit 
data to their parents. All other nodes receive data from all their children, fuse the incoming data signals along 
with their own data according to the data aggregation model and transmit to their parent. The nodes at the 
topmost level transmit data to the BS. Nodes {u s.t. lev{u) = 0} switch off to conserve energy when they are 
not transmitting. The operation of the network is divided into rounds. The network is reformed after the end 
of each round, which is after a fixed duration. We asssume collision free traffic in the network. This requires 
each node in HNp(F) to create a TDMA schedule and communicate this to their children. In addition the BS 
distributes spreading codes for direct-sequence spread spectrum (DSSS) [5] among the nodes in the networks. 

Simulations and Discussion In this section we simulate HN™ (F) and compare it to LEACH [5]. We compare 
the energy consumption, network lifetime and the effective data throughput received at the BS. The effective 
data throughput is measured by the number of data signals represented by the aggregated signal received at 
the BS. HNp (F) was observed to transmit 8 timesmore effective data than LEACH, when using unlimited data 
aggregation, and 2 to 3 times more when using limited data compression 

Now we describe the simulation environment and parameters. We mimicked LEACH's setup in order to 
compare our structure with theirs [5]. We initialized a network with 100 sensor nodes, spread uniformly over 
a square region of side 100 unit, each having the same energy of 2J to start with. We considered a node to 
be dead if node energy drops below O.IJ. We constructed HN^ (F), with parameter p = 0.5, periodically at 
intervals of 20s. The BS is located close to A but outside it. The bandwidth of the channel was taken to be 1 
Mb/s, and each data signal was taken to be 500 bytes long with a 25 byte packet header for each type of packet. 
We simulated this scenario and compared our mechanism against LEACH, observing energy consumption and 
throughput in the network. The time for a round was chosen to be 20s. 

In LEACH for a iV = 100 node network with k = 5 cluster heads, approximately N/k = 20 signals are fused 
into a single signal at each cluster head, assuming a uniform distribution. We simulated our setup for the same 
20:1 compression ratio, as well as for a lower compression ratio of 10:1. In the second data aggregation model, 
each node fuses all signals present to a single signal and then transmits to it's parent. 

Nodes at {u s.t. lev{u) = 0} sense data and send to their respective parents. Nodes at level i(> 1) receive 
data from all children, process the data and remove redundancy, and forward the effective data to their parent. 
This cycle continues till the highest level when data is sent to the BS. Nodes at the lowest level usually spend 
lesser amount of energy, whereas those at higher level will be quickly depleted of their battery power. 

For our simulation we assume a simple model for energy dissipation (the same model as used to study LEACH 
in [5]). To transmit I bits of data over a distance d, the energy dissipated is Etx{1, d) = lE^iec + lefsd^, and to 
receive a l-hit message is Ejix{l) = lEeiec- The radio electronics energy Eeiec = 50 nJ/bit, depends on the coding 
and spreading of the signal. And the amplifier constant e/s = 10 pJ/hit/m^ depends on the acceptable bit-error 
rate. In addition energy is consumed for data aggregation which is taken to be Eda = 5 nJ /bit/ signal. We work 
on the assumption that the nodes are capable of controlling their power in order to vary the transmission radius. 

Our simulation results are presented in Figures [71 [HI El The data points for LEACH are taken from (Hj . Clearly 
when using unhmited data aggregation ("HNp (F) Unhmited" in the plots), it is observed that the throughput 
(Figure [7]) and the ratio of data sent to energy consumed (Figure [S]) are much higher than for LEACH, which is 
expected since our structure aggregates at several levels. Even when we limit the amount of aggregation allowed 
(in the plots "HN™ (F) Limited < 20" represents a 20:1 compression and "HNp'(F) Limited < 10" represent 




Figure 7: Total amount of data received at the BS over time. 




Figure 8: Total amount of data received at the BS per given amount of energy. 
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Figure 9: Number of nodes alive over time. 



10:1) HNp (T^) outperforms LEACH in these two aspects. With network lifetime the picture is a little more 
complex. In Figure [9] we see that in HN™ (F) the time the last node dies is more or less the same as LEACH, 
although LEACH loses a lot of nodes suddenly while HN™ (F) degrades slowly. In conclusion, it appears from 
the simulation that our structure does not dominate LEACH in terms of lifetime but uses energy more efficiently 
and provides a higher throughput than LEACH does, running far ahead in the case of highly compressible data. 
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